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Abstract 

Background: Fragile X Syndrome (FXS) and its associated disorders are caused by the expansion of the CGG repeat in 
the 5' untranslated region of the fragile X mental retardation 1 {FMRl) gene, with disease classification based on the 
number of CGG repeats. The mechanisms of repeat expansion are dependent on the presence of cis elements and the 
absence of trans factors both of which are not mutually exclusive and contribute to repeat instability. Expansions 
associated with trons factors are due to the haploinsuffient or reduced expression of several DNA repair/metabolizing 
proteins. The reduction of expression in trans factors has been primarily conducted in animal models without substantial 
examination of many of these expansion mechanisms and trans factors in humans. 

Results: To understand the trans factors and pathways associated with trinucleotide repeat expansion we have analyzed 
two microarray datasets which characterized the transcript expression in patients with FXS and in controls. 

Conclusion: We observed significant down regulation of DNA damage/repair pathway transcripts. This observation was 
consistent in both datasets, which used different populations. Within these datasets, several transcripts overlapped in the 
direction of association and fold change. Further characterization of these genes will be critical to understand their role 
in trinucleotide repeat instability in FXS. 
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Background 

Fragile X Syndrome (FXS, OMIM 300624) is caused by a 
mutation in fragile X mental retardation 1 {FMRl), 
Prevalence estimates for FXS in the general Caucasian 
population is ~1 out of 4000 males and ~1 out of 8000 
females [1,2]. This prevalence rate has been subsequently 
substantiated by other reports, thus the rate is generally 
regarded as the prevalence rate in a randomly mating 
population [3]. FXS is a spectrum disorder in which af- 
fected individuals have IQs ranging from low or moder- 
ate to high functioning [4,5]. In over 98% of patients, 
FXS is caused by expansion of the CGG repeats in the 5' 
untranslated region of FMRl located adjacent to exonl 
on the X chromosome [6,7]. The CGG repeat region can 
be grouped into four general allelic forms, based on the 
CGG repeat length and stability, during transmission 
from parent to child. The allelic forms include common 
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variants containing 6-40 repeats; intermediate variants, 
sometimes termed gray zone alleles, containing 41-54 
repeats; premutation variants containing 55-199 repeats; 
and the full mutation variants, containing >200 repeats 
[1]. When the expansion exceeds 200 CGG repeats (i.e. 
full mutation), it causes methylation of the FMRl regula- 
tory region, which induces transcriptional silencing [8]. 
This CGG repeats are periodically interspersed with 
AGG interruptions; interestingly the presence of these 
AGG interruptions in the CGG repeat have been shown 
to reduced the risk of transmission of a full mutation to 
offspring [9]. The transmission of the FXS full muta- 
tion is usually maternally derived because sperm from 
males with a premutation or a full mutation only carry 
premutation alleles; however one case of a premutation 
male who transmitted a full mutation to his daughter is 
reported in the literature [10]. 

Similar to other trinucleotide expansion disorders (e.g., 
myotonic dystrophy, Huntington disease, and some 
spinocerebellar ataxias) FXS CGG repeat expansions are 
locus specific; that is, genome-wide instability is not 
observed. This suggests that the mechanism of repeat 
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expansions might not be caused by mutations in the 
DNA repair proteins or other trans acting factors, as 
such factors typically lead to genome-wide instability 
(e.g., several colorectal cancers and diseases [11]). 

Other mechanisms have been proposed to produce tri- 
nucleotide repeat expansions that do not require a muta- 
tion phenotype [12]. One mechanism suggests the 
alternation of the stoichiometric amount of protein 
product needed to maintain genomic integrity could 
underlie trinucleotide repeat expansions without the 
mutation phenotype. For example, a reduction in the 
stoichiometric ratio of enzymes critical for DNA repair/ 
replication leading to locus-specific expansions has been 
investigated in animal models [13,14]. Haploinsufficiency 
has mainly focused on the proteins in the ATM/ATR 
pathway to understand the intergenerational expansions 
of CGG repeats [13,14]. Reports have clearly demon- 
strated that haploinsufficiency of ATR and ATM (kinases 
that function in resolution of stalled replication forks 
and in double strand breaks, respectively) leads to in- 
creased intergenerational expansion of CGG repeats 
with a maternal and paternal bias, respectively [13,14]. 
Corroborating evidence of haploinsufficient expression 
of DNA repair/replication protein transcripts has not 
been reported in humans. Recently, expression analysis 
of transcripts has occurred in patients with FXS [15-18]. 
We used two datasets to determine if expression tran- 
scripts of DNA repair/replication enzymes were signifi- 
cantly altered in patients with FXS compared to controls. 
We observed that within both datasets there was sig- 
nificant down regulation of several enzymes with char- 
acterized functions in DNA repair and replication. In 
addition, both datasets also showed an overlap of several 
proteins important in maintaining genomic integrity. 

Methods 

Genes in DNA repair/replication pathways were com- 
piled from the REACTOME pathway database (www. 
reactome.org). FXS expression array data were obtained 
from two sources. First, microarray expression dataset 
GSE7329 was downloaded from the National Center for 
Biotechnology Information Gene Expression Omnibus 
(www.ncbi.nlm.nih.gov/geo/) database [16]. In this dataset, 
expression profiles of blood-derived lymphoblastoid cells 
from males with autism due to a fragile X mutation 
(FMRl-FM), or autism due to a 15qll-ql3 duplication 
(dup(15q)), or individuals without autism spectrum dis- 
orders (i.e. controls), were studied using Agilent Whole 
Human Genome Oligo Microarray G4112A. Probes cor- 
responding to the compiled list of genes in DNA repair/ 
replication pathways were identified on the Agilent 
Whole Human Genome Array. Log (base 10) transformed 
expression values were extracted from the downloaded 
GSE7329 dataset directly. Analysis of Variance (ANOVA), 



followed with specific contrast between FXS and control 
groups, was conducted in Partek Genomics Suite 6.5 
(Partek, Inc., St. Louis, Missouri, USA). Second, gene ex- 
pression related to DNA repair/replication pathways were 
also assessed using data from our collaborators. Recently, 
Rosales-Reynoso and colleagues published a gene expres- 
sion profiling study of the total peripheral blood from 10 
male patients with FXS and 10 controls [15]. Two-color 
Human Genome Microarray (MWG Biotech H10K_DB) 
from the Physiology Laboratory of the Universidad 
Nacional Autonoma de Mexico (http://microarrays. 
ifc.unam.mx) was used for this study. Target prepar- 
ation, hybridization, and initial data collection were 
performed according to the Physiology Laboratory's 
in-house protocol. Signal quantification and normalization 
were determined using the Array-Pro Analyzer 4.0 soft- 
ware for microarray images (Media Cybernetics, L.P., 
Silver Spring, MD). Details about the experiment 
protocol have been described previously [15]. Probes 
corresponding to the compiled list of genes in DNA 
repair/replication pathways were extracted from the 
dataset. One sample ^-test was conducted for the Log 
transformed expression intensity ratios between pairs 
of patients with FXS and controls in Partek Genomics 
Suite 6.5. False discovery rate control (FDR < 0.05) 
was applied to indentify the differentially regulated 
genes. 

Results 

Rosales-Reynoso and colleagues reported significant down 
regulation of Rad9A transcript, a DNA repair/cell cycle 
check point protein within the ATR/ ATM DNA repair 
pathway, and up regulation of MSH6 (DNA mismatch re- 
pair binding protein) in patients with FXS [15]. A decrease 
in Rad9A transcripts could significantly impair DNA 
repair/replication pathway which could lead to repeat ex- 
pansion. To further investigate the haploinsufficiency hy- 
pothesis, we examined the microarray datasets looking 
specifically at genes in DNA repair/replication pathways 
according to the curated REACTOME pathway database 
(www.reactome.org). A total of 312 genes involved in DNA 
repair and replication were obtained from REACTOME. 
Among them, 277 genes have corresponding probes (522 
probes) covered by the Agilent Whole Human Genome 
Array G4112A which was used in the GEO dataset. Ana- 
lysis of Variance (ANOVA) shows that 97 genes (35.0%) 
were differentially regulated between FXS samples and 
controls. Interestingly, the majority of the 97 genes (63 
genes, 64.9%) were significantly down regulated (p < 0.05). 
After correction for multiple testing, 45 genes (46.4%) 
passed the false discovery rate threshold (FDR < 0.05). The 
majority of these genes (33 out of 45 genes, 73.3%) were 
down regulated (Figure 1). Expression data for several tran- 
scripts are shown in Figure 1, including down regulated 
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Figure 1 Cluster heatmap showing expression profiles of DNA repair/replication related genes in individuals with fragile X Syndrome (FX) 
and control subjects (con). Individual transcripts are shown on the x-axis and individual subjects are shown on the y-axis. The gene expression 
intensities are color-coded with red being high and blue being low, as indicated by the color scale to the right. Fifty-six transcripts from 45 genes were 
used for an unsupervised Pearson cluster analysis. These 45 genes demonstrated differential regulation between patients with FXS and control subjects 
according to ANOVA analysis with FDR < 0.05. Similarities between subjects or transcripts are represented by the distances between branches to the left 
and on the top, respectively. The closer the branches, the more similar the expression profiles of the corresponding subjects/transcripts. 



ATM, Rad9A, and OGGl genes that are implicated in tri- 
nucleotide expansion [12,13]. To corroborate these find- 
ings, we used the dataset from a Mexican population with 
FXS [15]. This dataset contained 175 unique genes repre- 
sented on the arrays by 364 probes. Similar to the GEO 
dataset, the majority of genes in the DNA repair/replica- 
tion pathway were down regulated. 109 genes were differ- 
entially regulated between patients with FXS and controls 
ip < 0.05). 92 genes passed the false discovery rate thresh- 
old (FDR < 0.05). Of the genes that passed the false discov- 
ery rate threshold, 89 were down regulated with 60 genes 
having greater than 1.5-fold changes. We compared the 
overlap of genes between the two datasets and found that 
16 genes were present within both datasets which had 
similar fold change (Table 1). 

Discussion 

Taken together, these data suggest differential expression 
of transcripts in the DNA repair/replication in patients 
with FXS compared to controls. Within patients with 
FXS there is considerable heterogeneity in the expres- 
sion of many of the DNA repair/replication proteins 
(Figure 2). This could suggest that differential expression 
of several transcripts could lead to repeat expansion in 



Table 1 DNA damage/repair genes which overlap in both 
datasets with similar fold-change and direction of 
association 



Gene 
symbol 


p-value 


FDR corrected 
p-value 


Fold change 
(FX vs. con) 


Fold-change 

direction 
(FX vs. con) 


BRCA2 


0.004 


0.01 


-1.17 


down 


CDKNIA 


0.02 


0.03 


-1.11 


down 


ERCCl 


0.0003 


0.04 


-1.06 


down 


FANCC 


0.02 


0.001 


-1.08 


down 


NUP98 


0.002 


0.01 


-1.10 


down 


PSMBl 


0.02 


0.02 


-1.04 


down 


PSMD9 


0.0004 


0.03 


-1.08 


down 


PSME2 


0.0001 


0.008 


-1.07 


down 


PSME3 


0.0005 


0.003 


-1.10 


down 


RFC1 


0.003 


0.02 


-1.06 


down 


BUBS 


0.02 


0.009 


1.10 


up 


POLB 


0.007 


0.01 


1.08 


up 


PPP2R5A 


0.02 


0.02 


1.05 


up 


PSMDIO 


0.005 


0.01 


1.03 


up 


RAD 17 


0.01 


0.01 


1.05 


up 


RANBP2 


0.03 


0.05 


1.04 


up 
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Figure 2 Signal intensity Log ratios. Differential expression of Rad9A (blacl< bars), ATM (wliite bars), and OGGl (gray bars) in cell lines containing 
FMRl full mutation (FM) and controls. Rad9A and OGGl showed decreased expression of transcript in FMRl-FM cell lines compared to controls. In 
a few FMRl-FM cell lines we observed a significant decrease in ATM (GSMl 76575, GSMl 76586, and GSMl 76589); however, the expression of ATM 
was not significantly different from FMRl-FM and controls. 



some families essentially these families could be more 
susceptible to gene specific repeat expansions [19]. The 
down regulated transcripts could also suggest that there 
are several different pathways or combinations of down 
regulated transcripts which could lead to repeat expan- 
sion. The outcome of CGG expansion could occur 
through different mechanisms within individuals as sug- 
gested by animal models [13]. In fact, it is suggested that 
females transmit large expansions which appear to be re- 
pair dependent whereas males selectively delete large ex- 
pansion and transmit small expansions which are 
replication dependent [12]. This observation is supported 
by haploinsufficient animal models [12,13]. Based on ani- 
mal models and human data, we would suggest first char- 
acterizing the CGG expansion within the mother [if 
possible], with subsequent analysis in offspring by stratifi- 
cation of family members with premutations or FXS based 
on gender (e.g. paternal or maternal origin). In essence this 
method, previously used by Nolin and colleagues would 
allow for multi-generational characterization of the repair/ 
replication pathways essential for expansion [19]. 

Transcript expression data may not have a linear asso- 
ciation with translation; subsequent in vitro follow-up is 
needed to determine whether differential expression of 
transcripts in the DNA repair/replication pathway corre- 
sponds to alteration of protein levels in these enzymes. 
Further investigations are needed to provide models 



which couple decreased expression of DNA repair/repli- 
cation enzymes with the mechanism of trinucleotide 
repeat expansion in patients with FXS. Other evidence 
from mouse models also shows a maternal and paternal 
bias in the inheritance of expanded repeats based on 
specific DNA replication/repair enzyme [12,13]. In 
addition, familial studies of intergeneration expansion 
have indicated evidence of different expansion rates [18]. 
This suggests that characterizing the mode of inherit- 
ance of the CGG expansion (i.e., whether primarily ma- 
ternal or paternal) would be significant in understanding 
the inheritance. In effect, the mode of inheritance would 
provide significant data leading to an understanding of 
the enzymes which are significantly associated with ex- 
pansion pathways. 

Currently, only around 16 genes were found to be differ- 
entially regulated with the direction of association similar 
in both datasets. The limited number of genes found 
through this analysis is due to the fact that two different 
microarray experiments did not have a large number of 
overlapping genes for the DNA damage/repair pathway 
from REACTOME. We found that the degree of overlap 
between both datasets was around 50%. However, within 
this limited amount of data we found significant associa- 
tions which we hope can be replicated by others. Overall, 
these findings indicate that the DNA damage/repair path- 
ways could significantly contribute to repeat instability. 



Xu et a I. BMC Research Notes 201 3, 6:90 
httpy/www.biomedcentral.com/l 756-0500/6/90 



Page 5 of 5 



The current understanding of trinucleotide expansion 
disorders suggests that many of these expansions arise 
from several different mechanisms [12]. A second level of 
complexity would also suggest that the DNA repair/repli- 
cation mechanisms could have considerable cross talk [12]. 
Understanding the mechanism of trinucleotide repeat ex- 
pansion in FXS would be beneficial to understanding other 
trinucleotide repeat expansion disorders. Finally, the evolu- 
tionary significance of loci-specific repeat expansion disor- 
ders should not be understated. Evidence from this field 
could engender greater understanding of the evolution of 
the human genome and a greater understanding of how 
genome fidelity is maintained. 
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